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Abstract 

This paper studies the tension between throughput and decoding delay performance of two widely-used 
network coding schemes: random linear network coding (RLNC) and instantly decodable network coding (IDNC). 
A single-hop broadcasting system model is considered that aims to deliver a block of packets to all receivers in 
the presence of packet erasures. For a fair and analytically tractable comparison between the two coding schemes, 
the transmission comprises two phases: a systematic transmission phase and a network coded transmission phase 
which is further divided into rounds. After the systematic transmission phase and given the same packet reception 
state, three quantitative metrics are proposed and derived in each scheme: 1) the absolute minimum number of 
transmissions in the first coded transmission round (assuming no erasures), 2) probability distribution of extra coded 
transmissions in a subsequent round (due to erasures), and 3) average packet decoding delay. This comparative 
study enables application-aware adaptive selection between IDNC and RLNC after systematic transmission phase. 

One contribution of this paper is to provide a deep and systematic understanding of the IDNC scheme, to 
propose the notion of packet diversity and an optimal IDNC encoding scheme for minimizing metric 1. This is 
generally NP-hard, but nevertheless required for characterizing and deriving all the three metrics. Analytical and 
numerical results show that there is no clear winner between RLNC and IDNC if one is concerned with both 
throughput and decoding delay performance. IDNC is more preferable than RLNC when the number of receivers 
is smaller than packet block size, and the case reverses when the number of receivers is much greater than the 
packet block size. In the middle regime, the choice can depend on the application and a specific instance of the 
problem. 

Index Terms 

Packet broadcast, network coding, minimum clique cover, throughput, delay 
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I. Introduction 

Before the advent of network coding [1], different information flows incoming to a node in a data network were 
forwarded separately to other nodes in the network. By moving away from this traditional routing/scheduling 
approach and introducing the concept of information mixing, it was shown that network coding can achieve the 
information capacity of wired networks with error-free links under a single-source multicast scenario [1], [2]. 
Network coding can enhance the throughput performance of many other wired and wireless data networks in 
more general multicast or non-multicast settings and with error-free or error-prone communication links [2]- 
[6]. However, any statement about throughput optimality of network coding should be treated with care from 
a practical perspective. Roughly speaking, a higher data throughput using network coding does not necessarily 
translate into faster delivery of information to the application layer because the mixed information flows need to 
be "untangled" or network decoded first [7], [8]. 

Understanding the tension between throughput and delay in network coded systems has been the subject of 
research in recent years [7]-[27], where a packet-level network coding model is particularly suitable for such 
studies [8]. However, there are still many open questions to be answered and we will highlight some of them in 
the following. 

Random linear network coding (RLNC) [28] has been one of the most successful and practical network coding 
schemes proposed in the literature. The encoding is very easy where a block or generation of Kt data packets 
are combined using random coefficients from a finite field. In a rateless wireless broadcast scenario with packet 
erasures, RLNC also requires minimum amount of feedback from the receivers, as they only need to acknowledge 
successful reception of Kt linearly independent coded packets [8]. It has been shown in [8] that rateless RLNC 
is throughput optimal 1 and can significantly outperform traditional scheduling approaches in terms of block 
completion time and hence, bandwidth or throughput efficiency. Nevertheless the fundamental tradeoff between 
throughput and delay still exists. A larger block size can enhance throughput and the broadcast network capacity 
can be arbitrarily closely approached [20]. But this comes at the cost of larger delays since packet delivery to 
the application layer cannot occur until block-wise network decoding is complete [7], [8], [20]. 2 

This has motivated researchers to look for other network coding schemes to improve packet decoding or 

'For large field sizes, we can assume that every encoded packet is linearly independent of all previously received k < Kt packets [28]. 
2 For large field sizes, we can ignore the probability of intermediate decoding in RLNC. 
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delivery delay. However, improved delay often comes at the cost of more frequent feedback and more complex 
decision making for packet encoding. For example, [14], [15] proposed network coding schemes for erasure- 
prone wireless broadcast, which maintain throughput optimality and use receivers' online feedback about every 
packet reception to dynamically select appropriate encoded packets for the next transmission. It was conjectured 
(and shown through numerical simulations) that the delay asymptotically follows that of a single-receiver case 
when the transmission rate approaches erasure link capacity. However, general non-asymptotic characterization 
of the delay performance of [14], [15] has proven to be challenging [12]. Only recently analytical results were 
presented on one of the three conditions for decoding [26], which were then used in [27] to propose a dynamic 
rate adaptation scheme. It was shown that by occasional carefully-balanced sacrifices in throughput, an overall 
better throughput-delay curve compared to [15] can be achieved. 

Another class of online network coding schemes which attempt to strike a balance between throughput and 
delay is instantly decodable network coding (IDNC) for erasure-prone wireless broadcast [9], [11], [12], [17]— [19], 
[22]. In a general case of IDNC [18], [19] and for a given receiver, coded packets either 1) contain one new data 
packet and hence are instantly decodable, 2) do not contain any new data packets and hence are non-innovative, 
or 3) contain more than one new data packet and hence, are deemed unusable. 

One advantage of IDNC is simple XOR-based encoding and decoding. Another important advantage is that 
instantly decoded packets can be potentially delivered to the application layer much faster than the RLNC scheme 
and those discussed in [14], [15], especially if packets are usable regardless of their order [11], [17]. However, 
similar to [14], [15] IDNC requires full use of feedback for making online network coding decisions. In addition, 
IDNC scheme is generally not throughput optimal (there exist situations where a subset of receivers receive 
either non-innovative or unusable packets). Any attempt to provably minimize either IDNC block completion or 
decoding delay, even in the absence of packet erasures, results in NP-hard problems [11], [17]— [19], [22], [29]. 
Consequently, heuristic or stochastic approaches, corroborated with simulation, are often used in the literature to 
enhance and verify IDNC throughput or delay performance [9], [11], [12], [17]— [19], [22], [29]. It is fair to say 
that theoretical interaction between throughput and delay of IDNC is not very well understood, mainly due to 
the complicated nature of IDNC decision making process, especially with packet erasures. 



II. Problem Formulation and Summary of Contributions 

In summary, RLNC is throughput optimal, but can incur large block decoding delays. Whereas, IDNC is not 
necessarily throughput optimal, but its immediate decoding property can be more beneficial for intermediate packet 
delivery to the application layer. The main open question is then how the throughput and delay performance of 
these two schemes compare. In this paper, we aim to provide a unified theoretical framework for quantitative 
comparison between throughput and delay performance of RLNC and IDNC schemes. To this end and to enable 
a meaningful comparison, we consider a transmission scenario which is briefly discussed and justified here. More 
detailed system model and notations will be provided in Section III. 

1) There are Kt packets to be broadcast to a set of N wireless receivers where each receiver link is subject to 
independent packet erasures. Transmission is rateless, meaning that no new packet is sent until all receivers 
have received all Kt packets. 

2) In the first phase of transmission, Kt uncoded packets are broadcast, which is known as the systematic 
phase of network coding [16], [23]. In order to perform IDNC in subsequent transmission phases, systematic 
transmission is essential in the first phase [17], [18]. This is, however, a slight variation of traditional RLNC 
scheme. Nevertheless, it does not result in loss of throughput optimality in RLNC and has some additional 
advantages [16], [23]. For example, the receivers can pass some of the uncoded received packets in a 
meaningful way to the application layer before the whole block of Kt packets is successfully received. 

3) After the systematic transmission phase, each receiver provides feedback to the sender about the packets it 
has successfully received or lost. Assuming that there still exist some missing packets across receivers and 
given the feedback from each receiver, the following key metrics about the throughput and decoding delay 
performance of RLNC and IDNC are determined at the sender. 

a) Metric 1: The absolute minimum number of coded transmissions that is needed to satisfy the demands of all 
receivers in RLNC and IDNC schemes in the best case scenario where there is no future packet erasures. 
This is denoted by ?7rlnc an d £^idnc> respectively. While C/rlnc is simply equal to the maximum number 
of lost packets across all receivers, finding C/idnc is far fr° m trivial. At this stage, we can only assert 
that ?7idnc ^ ^rlnc due to the fact that IDNC is not necessarily throughput optimal. However, precise 
answer to this question cannot be determined analytically and is even hard to solve algorithmically for 
large Kt, since it requires solving an NP-hard problem [29]. 
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b) Metric 2: The probability that there is still a need for more transmissions due to packet erasures. More 
specifically, the probabilities of still needing ^ Vrlnc ^ £^rlnc an d ^ VJdnc ^ ^idnc extra trans- 
missions in each scheme (after the minimum of C/rxnc or ^idnc coded packets are already transmitted 
by the sender) are analytically computed. 

c) Metric 3: The decoding performance of RLNC and IDNC schemes. More specifically, at the u-th coded 
transmission after the systematic transmission phase (u ^ C/rlnc or u ^ f7iDNc)> the probability of 
decoding packets at each receiver is analytically derived, from which the average packet decoding delay 
across all the receivers is computed. 

4) Based on the derived metrics, either RLNC or IDNC scheme is adopted by the sender for subsequent 
coded packet transmissions. Packets are RLNC or IDNC encoded and transmitted until all Kt packets are 
successfully received by all the receivers. 

A few notes are in order about the use of feedback and predictive performance metrics in our problem 
formulation. First, feedback is required at the end of systematic transmission phase. This is not part of traditional 
RLNC. However, it enables quantitative comparison of RLNC and IDNC schemes fairly (given the same packet 
reception/loss status at the receivers) and choosing either scheme for the subsequent coded transmission phase. 
Second, metrics 2 and 3 are computed assuming that no intermediate feedback is available at the sender before the 
minimum of ?7rxnc or ^idnc coded packets are transmitted. In addition to being practically attractive for IDNC, 
this less frequent semi-online feedback enables predictive analytical comparison of RLNC and IDNC schemes. 
In Section IV we will prove that fundamentally, metric 1 cannot be reduced regardless of feedback frequency or 
packet erasures. However, we will discuss that the overall throughput of IDNC can be enhanced if fully-online 
feedback is available after each erasure-prone packet transmission. As a result, our derived performance metrics 
can be interpreted as lower bounds to what can be achieved in fully-online IDNC. This would actually mean a 
smaller throughput gap between RLNC and fully-online IDNC, as well as a better decoding delay performance 
gain of fully-online IDNC over RLNC. 

To the best of our knowledge, such comparative framework for RLNC and IDNC schemes has not been 
studied before. Perhaps, the closest work to ours is [22] where the authors proposed a random graph approach to 
approximate metric 1 in IDNC, J7idnc> m multicast scenarios. In contrast, we are interested in ?7tdnc given the 
actual deterministic IDNC graph, obtained after the first systematic transmission phase. Section IV is devoted to 
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theoretical understanding of £/idnc> where we provide several definitions, proofs, and bounds on the relation 
between E/idnc an d the IDNC graph. We also propose an algorithm for determining C/idnc coded packet 
transmissions before semi-online feedback collection. This is needed later for computing metrics 2 and 3. While 
some similar (but not identical) concepts, bounds, and algorithms exist in the graph theory literature, our aim 
at providing a comprehensive and coherent treatment of C/idnc in the context of network coding is novel and 
useful. For example, we introduce concepts such as packet diversity, which can crucially affect IDNC robustness 
to erasures and its throughput and decoding delay performance. We also discuss the impact of feedback frequency 
on IDNC throughput efficiency. Finally, we highlight that both metrics 2 and 3 are new and were not studied 
in [22]. They will be treated in Section V and Section VI, respectively. 

III. System Model and Notations 

A. Transmission Setup 

We consider a packet-based wireless broadcast scenario from one sender to N receivers. Receiver n is denoted 
by R n and the set of all receivers is 1Z = {R\, ■ ■ ■ ,Rn}- There are a total of Kt packets to be delivered to all 
receivers. Packet k is denoted by Pj, and the set of all packets is Vk t = {Pi, ■ ■ ■ ,Pk t }- Sometimes we will 
refer to P^ as an original data packet to distinguish it from a coded packet. Time is slotted and in each time slot, 
one (coded or original data) packet is broadcast. The wireless channel between the sender and each receiver is 
modeled as a memoryless erasure link where the erasure probability for receiver n is P e ,n- We further assume that 
the erasures across receivers are independent of each other. For simplifying the theoretical analysis later in the 
paper, we will impose the additional condition of equal erasure probability among links. That is, P e n — P e for 
n = 1 , • • • , N. However, the theoretical framework proposed in the paper can be generalized, with appropriate 
modifications, to non-homogeneous erasure links. 

B. Transmission Phases and Rounds 

The minimum number of packet transmissions to satisfy the demands of all receivers is Kt and there is no 
need to employ any type of coding for the first phase comprising Kt transmissions. This is the systematic phase 
of network coding. 

After the first Kt systematic packet transmissions, each receiver provides feedback to the sender about the 



packets they have received or lost. 3 If a packet is successfully received by all receivers, it can be removed from 
any future coded packet transmission. Therefore, in the coded transmission phase we will deal with K < Kt 
packets that are wanted by at least one receiver and the set of all such packets is Vk = {Pi,-" , Pk}- The sender 
will then calculate some performance metrics and determine to use IDNC or RLNC for the subsequent phase of 
coded packet transmission until all receivers have received all packets. We further split the coded transmission 
phase into rounds. In the first round, the minimum of C/tdnc or ^rlnc coded packets are transmitted, which 
are needed in each scheme in the best case scenario of no future packet erasures. However, if due to erasures, 
some packets are still missing at some receivers at the end of the first round, at least one more round with 
Vidnc ^ ^idnc or Vrlnc ^ ^rlnc coded transmissions will be required. 

C. Receiver Demands and State Feedback Matrix 

After the systematic transmission phase, the complete state of receivers and packets can be captured by an 
N x K state feedback matrix (SFM) A (also known as receiver-packet incidence matrix [17]), where the element 
at row n and column k is denoted by a n ^ and 



a n ,k — < 



1 if R n wants Pk, 

(1) 



otherwise. 

Based on the SFM, we can define the following notions of Wants set for each receiver and Targeted receivers 
for each packet. 

Definition 1. The Wants set of receiver R n , denoted by W n , is the subset of packets in Vk which are lost due 
to erasures during systematic network coding phase and are missing at R n . That is, W n = {Pk '■ &n,k = !}• The 
size of W n is denoted by W n . 

Definition 2. The Target set of a packet Pk, denoted by %, is the subset of receivers in 1Z who want packet Pk- 
That is, Tk = {R n '■ cin,k = !}• The size of Tk is denoted by Tk- 

The size of the smallest and largest Wants sets among all receivers are denoted by W m - m and W max , respectively, 

which will be key parameters in determining the system performance. 

3 We assume that there exists an error-free feedback link between each receiver to the sender that can be used with appropriate frequency. 



8 



D. Coded Transmissions Using RLNC 

In the case of RLNC, all the packets in Vk (that by definition are wanted by at least one receiver) are linearly 
combined with each other at the sender using randomly chosen coefficients from a finite field ¥ q of size q and 
then broadcast to all receivers. The r-th coded packet is 



where a^^ G ¥ q and the coding coefficient vector for the r-th coded packet is a r = (a± ir , ■ ■ ■ ,a>K,r) T and the 
superscript T denotes vector transpose. Upon successful reception, these linearly coded packets along with their 
coding coefficient vectors are stored by each receiver. 

Definition 3. The knowledge space of each receiver after r successful coded receptions is defined as the linear 
space spanned by the coding coefficient vectors ol\ to a r . 

If the field size q is sufficiently large, we can comfortably assume that each successful coded packet reception 
will be innovative to a receiver who has not decoded all packets yet and hence, will increase the dimension of 
its knowledge space by one [28]. The minimum number of coded packet transmissions for RLNC, £/rxnc> is 
equal to the size of the largest Wants set across all receivers, W max . 

E. Coded Transmissions Using IDNC 

Determining suitable packets to code together that satisfy the IDNC constraints is much more complicated 
than RLNC. In this subsection, we first present some basic definitions related to IDNC. Then we will briefly 
discuss existing models in the literature to deal with the IDNC problem. Later in Section IV, we will provide a 
deeper understanding of the IDNC and its minimum number of coded transmissions C/idnc> which is one of the 
contributions of this paper. 

Definition 4. A coded packet is instantly decodable for receiver R n if it contains only one original data packet 
that R n has not decoded yet. 

Definition 5. A coded packet is non-instantly decodable for receiver R n if it includes a linear combination of 
two or more original data packets that R n has not decoded yet. 

Definition 6. A coded packet is non-innovative for Rn if it only contains original data packets that Rn has 
already decoded. Otherwise, the coded packet is innovative. 



K 




(2) 



fc=i 
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It suffices to use the binary field F2 for IDNC coded packet transmissions. More precisely, the r-th transmitted 
coded packet in the IDNC scheme is 



where G F2 and the coding coefficients vector is f3 r = (/3i, r ,- - - ,(^K,r) T with /3 r G {0,1}^. The IDNC 
problem is then to determine the "best" coding coefficients vector (3 r = (Pi, r , ■ ■■ , ^K,r) T for each time slot r. 
This has been recognized as a highly non-trivial and computationally complex problem [11], [17]— [19], [22], [29] 
and there have been various approaches in the literature for solving this problem, which we now briefly discuss. 

F. S-IDNC versus G-IDNC 

One main model to capture IDNC constraints and to determine coded packets is strict IDNC or S-IDNC [11], 
[17], [29]. Imposing the S-IDNC constraints means that for a given receiver R n , Y r can at most contain one 
packet from the Wants set of that receiver. Mathematically, either J2k-a n k =i Pk,r = 1 resulting in decoding one 
new packet at R n or Ylk-a k =i &k,r = which renders Y r non-innovative for R n . The S-IDNC constraints for 
all receivers can be concisely written using the SFM A as A(3 r ^ ljv, where ljy is an all-one N x 1 vector 
and the inequality means that every entry in A(3 r is smaller or equal to the corresponding entry in ljv- As 
has been mentioned in [29], the S-IDNC constraints imposed by the SFM, A, can also be represented using 
an undirected graph with K vertices corresponding to K wanted packets. In Section IV, we will examine the 
graphical representation of S-IDNC in detail and provide new insights into the S-IDNC problem. 

S-IDNC is a strong type of IDNC. Intuitively, its main limitation is that some coding opportunities benefiting 
a large number of receivers may be lost just because a small number of receivers cannot instantly decode those 
coded packets. This can potentially reduce the throughput. In contrast, G-IDNC proposed in [18], [19] relaxes 
the problem to a certain extent with the aim to improve throughout and decoding delay (compared to [17]), while 
still benefiting from simple XOR-based decoding at the receivers. The sender may send packets that are instantly 
decodable only for an appropriately chosen subset of receivers. If a receiver receives a coded packet which is 
non-instantly decodable to it, it discards that packet. In other words, the sender is not restricted to send fully 
IDNC packets for every receiver, but the receivers adhere to IDNC decoding principle. 

Obtaining the corresponding G-IDNC graph from the SFM A is detailed in [18], [19], [22], where the authors 



K 




(3) 



k=i 
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use packet (fully-online) feedback to update the graph after each transmission and select a maximal clique^ 
through a heuristic maximum weighted vertex search. However, the size of G-IDNC graph is in general 0(NK) 
as opposed to the size of S-IDNC graph, which is K. Hence, finding the minimum number of coded transmissions 
for G-IDNC (even with no erasures) is much more computationally expensive than S-IDNC. This is why a random 
graph approach has been adopted in [22] to adaptively select between RLNC and G-IDNC in multicast scenarios. 

In the rest of this paper, our aim will be to better understand and characterize the S-IDNC problem and compare 
it with RLNC from a theoretical viewpoint. We will simply refer to S-IDNC as IDNC. Based on the reported 
results in [18], [19], we expect that the throughput performance of G-IDNC is superior to S-IDNC. Hence, the 
study of this paper can serve as worst-case throughput gap between G-IDNC and RLNC, as well as worst-case 
decoding delay performance gain of G-IDNC over RLNC. 

IV. Understanding IDNC 

As has been mentioned in the last section, IDNC constraints of an SFM A can be represented by an undirected 
graph Q(V,£) with K vertices. Each vertex v i G V represents a wanted original packet P{ and two vertices Vi 
and Vj are connected by an edge ejj 6 £ if P{ and Pj are not jointly wanted by any receiver [29]. This graph 
model, however, has only been employed to heuristically find IDNC encoding solutions in the literature [29]. 
In this section, we will revisit this graph model and its equivalent matrix/set representation. We provide several 
definitions that facilitate our analysis of the IDNC performance in this and the next two sections. We then prove 
the relation between the minimum number of coded transmissions, C/tdnc> an d the IDNC graph and discuss the 
effect of feedback frequency on IDNC throughput. We also present two simple bounds on C/tdnc- Finally, we 
propose an IDNC encoding algorithm which is optimum in terms of C/idnc- Although some similar concepts and 
results exist in the graph theory literature, their compilation, presentation and more importantly interpretation in 
the IDNC context is new, to the best of our knowledge. We will highlight the similarities, differences, and new 
results as appropriate. 

4 In an undirected graph, all vertices in a clique are connected to each other with an edge and a clique is maximal if no other vertex 
can be added to it to form a larger clique. 
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A. IDNC Modeling 

In this subsection, we construct the matrix/set model of IDNC and demonstrate its relationship with the graph 
model. The construction of the matrix/set model is based on the concept of conflicting packets, which is defined 
below. 

Definition 7. We say that two packets Pi and Pj conflict with each other if there exists at least one receiver who 
wants both packets. That is, Pi and Pj conflict if both belong to the Wants set, W n , of at least one receiver such as 
R n . Mathematically, we can denote a conflict between Pi and Pj by PiQ)Pj, where Pi&Pj 3n : {Pi, Pj} C W n . 

It is clear that based on the requirement of IDNC, two conflicting packets Pi and Pj cannot be encoded 
together. The equivalent of such conflict in the graph model is the absence of an edge between the two vertices 
[29], [30]. 

Definition 8. We say that two packets Pi and Pj do not conflict with each other if there is no receiver who 
wants both packets. Mathematically, we can denote a lack of conflict between Pi and Pj by Pi&Pj, where 
Pi0Pj^$n:{Pi,Pj}cW n . 

The equivalent of two non-conflicting packets in the graph model is an edge between the two vertices [29], 
[30]. 

Based on Definition 7 and 8, we can define a triangular conflict matrix C of size K(K — l)/2 that can fully 
describe the conflict state of packets based on the SFM A and is given below. 

Definition 9. A fully-square conflict matrix of size K 2 is a binary-valued matrix with element at row i and column 
j denoted by Cij corresponding to the conflict state of packets Pi and Pj. In particular, o h j = 1 if Pi® Pj and 
Ci d = if Pi0Pj. 

Due to the symmetry of conflict between packets and noting that Ci^ = 0, VPj, we can reduce the fully-square 
conflict matrix to a triangular matrix C of size K(K — l)/2. From now on, by conflict matrix, we mean the 
reduced triangular matrix C. 
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Example 1. An example of a conflict matrix is given below. 
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Note that in dealing with the conflict matrix, we are not concerned with the receivers who may need a certain 
packet. In fact, it is not difficult to show that two or more SFMs can have the same conflict matrix. Unless 
otherwise stated, it suffices to deal with conflict matrix C, instead of SFM A. 

We now define the key concept of maximal encoding sets that are allowed for IDNC transmission. 

Definition 10. A maximal encoding set, M., is a collection of original packets which simultaneously hold the 
following two properties: 1 ) their coded transmission do not violate the instant decodability constraint for any 
receiver R n . That is, Ylp-eM ^ !J e ^ tner instantly decodable or non-innovative for every receiver R n £ 1Z, where 
the sum is over F2; 2) addition of any other packet from Vk \M. to M. will result in the violation of instantly 
decodable property of 2~2p eM f or at ^ east one rece ^ ver sucn as Pn £ 

Based on the definition of conflicting and non-conflicting packets, we can say that firstly none of the packets in 
a maximal encoding set M should conflict with each other. That is, Vi, j : {Pi, Pj} C M =>• Pi0Pj. Furthermore, 
addition of any packet such as from Vk\M. to M will result in conflict(s) with its existing member(s). That 
is, VPfc 6 Vk \ M., 3Pj G M : Pk Pj. The equivalent of maximal encoding sets in the IDNC graph model is 
known as maximal cliques [30]. 

In Example 1, it can be easily verified that {Pi, P2, P3}, {Pi, Pa}, {^2,^5} and {P^,Pq} are all maximal 
encoding sets. 

For reasons that become clear at the end of this subsection, we ensure that in each IDNC coded packet 
transmission, the sender will encode all and not a subset of the original data packets in a maximal encoding 
set. To satisfy the demand of all the receivers, the sender has to transmit encoded packets from an appropriately 
chosen collection of maximal encoding sets. To achieve this, each original data packet should appear at least 
once in this collection. This condition can be formally represented by the diversity constraint, where: 
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Definition 11. The diversity of a packet Pi within a collection of maximal encoding sets is denoted by d% and is 
the number of maximal encoding sets in which it appears. 

Definition 12. A collection of maximal encoding sets satisfies the diversity constraint iff every original packet 
has a diversity of at least one within this collection. 

For example, given three maximal encoding sets: Mi = {P\, P2, P%}, M2 = {Pi, Pa}, M3 = {P2,P§}, we 
have d\ = d2 = 2, d% = d^ = d$ = 1. 

Given all the maximal encoding sets of a conflict matrix C, there exists at least one collection which satisfies 
the diversity constraint (in the extreme case all the maximal encoding sets include all the original packets). The 
size of the collection is the number of maximal encoding sets in it. We then define the minimal collection and 
its size as follows: 

Definition 13. A collection of maximal encoding sets is minimal if there does not exist any collection which 
satisfies the diversity constraint with a smaller size. The size of the minimal collection is called the minimum 
collection size. This number, as we will prove in the next subsection, is exactly the absolute minimum number of 
coded transmissions, E/tdnc- 

We denote a minimal collection by S = {Mi, ■ ■ ■ ,Mu IDNC }- A problem in the graph theory which is similar 
to finding a minimal collection of maximal encoding sets is the "minimum clique cover" problem [29], [30]. 
These two problems differ somewhat because cliques do not overlap in the minimum clique cover problem. That 
is, the cliques are not necessarily maximal and each vertex appears in only one clique. This would be equivalent 
to choosing a minimal collection of encoding sets in our IDNC solution where all packets have a diversity equal to 
1. This would not change C/tdnc- However, it can have a serious adverse impact on IDNC robustness to erasures, 
which in turn degrades the IDNC overall throughput and decoding delay performance. Consequently, it is desirable 
to choose a minimal collection of maximal encoding sets that, while satisfying minimum E/tdnc> provides as 
much packet diversity as possible. This will be further clarified in Section IV-D, Section V and Section VI. 

B. The Equivalence between C/idnc and the Minimum Collection Size 

In this subsection, we prove that the absolute minimum number of coded transmissions, C/idncs equals the 
minimum collection size of the IDNC conflict matrix. We will also comment on the role of feedback frequency 
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on the number of coded transmissions in IDNC. 

For a given conflict matrix C, proving the equivalence between the absolute minimum number of coded 
transmissions and the minimum collection size is equivalent to proving the following theorem: 

Theorem 1. Upon successful reception of a maximal encoding set M. of C by all the target receivers of M, the 
minimum collection size of the updated conflict matrix C is at least ?7idnc — 1- 

This theorem holds if the following two theorems hold: 

Theorem 2. The minimum collection size of a conflict matrix C with a graph model Q equals the chromatic 
number of the complementary graph of Q, denoted by Q, whose vertex connectivity is opposite to Q. 

Theorem 3. Suppose M. is a maximal clique of Q and the chromatic number of Q is U\ymc- By removing the 
vertices in M. from Q we obtain an updated graph Q'. The chromatic number of Q' is at least ?7tdnc — 1- 

The Q' here is indeed the graph model of the updated C because if M has been successfully received by all 
its targeted receivers, original packets in M are no longer wanted. That is, the corresponding vertices can be 
removed from Q. The proofs of Theorem 2 and 3 are provided in Appendices A-B. 

Remark 1. Greedy/heuristic algorithms cannot guarantee C/idnc- ^ n me proof of Theorem 3 we mentioned that 
if a maximal encoding set Ai, which does not belong to any minimal collection of the conflict matrix C is 
successfully received by its target receivers, the chromatic number of the updated Q' is still JTidnc- This means 
that the updated matrix C still needs [/tdnc transmissions. For example, consider the maximal encoding sets 
in Example 1. A suboptimal IDNC algorithm might choose {P\^P2 : Pz}, which does not belong to the only 
minimal collection S = {{-Pi, P4}, {P2, -P5}, {P3, P&}}- Even if this set is successfully received, three further 
transmissions are needed for P4, P5 and Pq. In total there will be 4 transmissions, greater than C/idnc = 3. 

Assume that the sender chooses IDNC over RLNC scheme and finds an appropriate S. Clearly, the order 
of transmission of maximal encoding sets within S does not alter £/idnc> but it can affect the decoding delay 
performance of IDNC and will be discussed in Example 3 and Section VI. In what follows, we assume that the 
maximal encoding sets within S are appropriately ordered. Then all packets in set Mi are added together in ¥2 
and transmitted in the first coded transmission. The process is repeated until all packets in Mu IDNC are encoded 
and transmitted. 
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According to the above theorems, collecting fully online feedback during C/idnc transmissions cannot reduce 
the total number of coded transmissions below ?7tdnc> even in the best case scenario of erasure-free packet 
reception. Hence, as a variation of existing IDNC schemes in the literature, one can reduce feedback frequency 
to semi-online feedback, where the SFM A is updated in rounds. For example, after ?7idnc encoded packets 
from a selected minimal collection S have been transmitted and so on. In fact, the systematic phase of IDNC is a 
special case in this scheme: here the SFM A is a Kt x Kt all-one matrix, which requires that all the Kt original 
packets should be transmitted separately in C/idnc = Kt transmissions before the sender collects feedback. 

In addition to making the throughput and delay analysis of IDNC tractable, a lower feedback frequency can be 
advantageous in practical implementation of IDNC where the use of reverse link is costly and involves transmission 
of some control overheads. Another practical attraction is that it also avoids solving the computationally expensive 
IDNC encoding problem in every time slot. However, this comes at the potential cost of overall throughput 
inefficiency in semi-online IDNC, as we explain below. 

Now imagine an IDNC scheme in the presence of erasures. In the semi-online feedback case, the sender sends 
the entire maximal encoding sets in S before collecting feedback and solving the IDNC encoding problem again. 
Let us assume that in total, X IDNC transmission rounds are required with a total of U^ onYmc = £* =1 Uf DNC 
coded transmissions. In contrast, an optimal fully online IDNC scheme works as follows. It updates the SFM 
A after each coded transmission and constructs the corresponding conflict matrix C. It then finds C/idnc an d a 
minimal collection S for C. However, it only sends the encoded packets in M\ from S before collecting another 
feedback from the receivers and solving the IDNC encoding problem again. The sender repeats the process until 
all packets are received by all receivers. Let us denote the total number of coded transmissions in this case by 
^i°dnc • < - )ne can fi 11 ^ examples where U°$$q < ^idn'c onlme , where one such example is provided in Appendix 
C. However, characterizing performance differences between fully online and semi-online IDNC is still an open 
question and beyond the scope of this paper. Intuitively, we expect the difference to be small when packet erasure 
probability is low. In any case, our throughput and delay analysis of IDNC scheme based on semi-online feedback 
serves as a worst-case scenario for an optimal fully-online IDNC with packet erasures. 
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C. Two bounds on C/idnc 

Here, we present an upper bound and a lower bound on C/idnc > which only depend on the number of zeros 
in the conflict matrix C. These bounds are important references for practical/heuristic IDNC encoding algorithm 
design: any algorithm offering C/tdnc above the upper bound or below the lower bound is throughput inefficient 
or non-instantly decodable, respectively. 

For any N x K SFM A, its conflict matrix C fully represents the corresponding IDNC encoding problem. 
Thus C/idnc on ly depends on the number (denoted by Mo) and positions of zeros in C. Although determining 
C/idnc f or a C with an arbitrary positioning of Mo zeros is NP-hard, by deliberately positioning these zeros we 
are able to find out the minimum and maximum possible C/idnc- Both the upper- and lower-bound depend only 
on K and Mq and are denoted by C/idnc (-K', Mq) + and C/idnc (-^ Md) ~, respectively. 

• U 1DNC (K,M )+ 

The general intuition here is trying our best to waste the encoding opportunities brought by the Mq zeros. 
We first note that for any given packet, there are K — 1 entries in C about the conflict of that packet 
with all other packets. When Mo = 0, there is no encoding opportunities, so C/idnc {K, Mq = 0) + = K. 
When Mo belongs to the closed interval [1,K — 1], we can assign all zeros to the entries about the same 
packet, say Pi, to increase its encoding opportunities. But C/tdnc remains K — 1 because P2, ■ ■ ■ , Pr have 
to be transmitted separately. After K — 1 zeros have been exhausted, there are K — 2 entries in C about 
every packet other than P\. Thus when Mo belongs to the closed interval [K, (K — 1) + (K — 2)], we can 
assign these extra K — 2 zeros to the entries about the same packet, say P2, and C/idnc remains K — 2 
because P3, • • • , Pr have to be transmitted separately. This iterative process indicates that C/idnc (K, Mq) + 
decreases in a staircase way with Mo. The relationship can be expressed as: 

M = 

M G [1,K-1] 

M G [K, 2K - 3] W 
M = K(K-\)/2 

and is shown in Fig. 1. This bound can also be derived from an upper-bound on the chromatic number of 



K-h 



U mNC (K,M ) + = < k_ 2 , 
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the corresponding complementary graph Q: 



x{Q) < HQ) 



(5) 



where A(Q) is the maximum degree of Q. Suppose every original packet Pj has <5j nonzero entries in C 
about it, then A(Q) = maxj(<5j). 



The intuition here is making the best of the encoding opportunities brought by the Mq zeros. In other words, 
we should use as few zeros as possible to reduce J/idnc by one. When Mo = 0, no packets can be encoded 
together. Thus S = {{-Pi}, {P 2 }, ■■■ , {Pk}} and C/idnc = K- We then reduce C/tdnc iteratively. In each 
iteration, C/tdnc can be reduced by 1, i.e., the size of S can be reduced by 1, if we can merge two maximal 
encoding sets in S together. In order to use as few zeros as possible, we always pick two sets with the 
smallest sizes (say m and n) to merge, which requires mn zeros to resolve the conflicts between the packets 
in them. Hence, in each iteration, it is impossible to reduce C/tdnc by 1 until mn new zeros are added to 
C. This iterative process provides a lower-bound of C/"tdnc(^, Mq). Below is an example with K = 5: 



In the first iteration, we can reduce the size of S by 1 by merging {P\} and {P2} together, which requires 
one zero, as in Eq. (6a). In the second iteration, we can reduce the size of S by one by merging {P3} and 
{P4} together, which requires 1 zero, as in (6b). In the third iteration, 2x1 = 2 zeros are needed to merge 
the two smallest maximal encoding sets {P±, P2} and {P5} together because we have to resolve the conflicts 
between P\ and P5 and between P2 and P5. In the last iteration, 3x2 = 6 zeros are needed to merge 
{Pi, P2, -P5} and {P3,Pi} together. After that, all the 10 entries in C become zeros and C/idnc becomes 
1. 



• ?7idnc(^,Mo)- 



Example 2. 



S 



{{Pi}, {P 2 }, {P 3 }, {P 4 }, {P 5 }} t/lDNC = 5 



s 



{{Pi, P 2 }, {P 3 }, {P 4 }, {P 5 }} 1x1 = 1 zero needed, U lDNC = 4 



(6a) 



S 



{{Pi, P 2 }, {P 3 , P 4 }, {P 5 }} 1x1 = 1 zero needed, U 1DNC = 3 



(6b) 



S 



{{Pi, P 2 , P 5 }, {P 3 , P 4 }} 2x1 = 2 zeros needed, U WNC = 2 



(6c) 



S 



{{Pi, P 2 , P 3 , Pa, P 5 }} 2x3 = 6 zeros needed, C/idnc = 1 



(6d) 
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Similar to the upper-bound, the lower-bound also decreases in a staircase way with Mq. In this example: 

5, M = 
4, M = l 

^IDNC(5,M ) _ = <j 3 Mq e [2,3] ( ? ) 
2, M g [4,9] 
1, M = 10 

This relationship can be approximated by a single formula which is derived by Geller in [31], [32] using a 
different approach: 



U mNC (K,M Q ) 



K 2 



(8) 



K + 2M 

where |~m] denotes the smallest integer greater than m and one can easily verify that our bound is tighter. 

Although the upper- and lower-bounds of [/tdnc(-^> Ma) can be determined, there does not exist a simple 
theoretical approach to find out the distribution of Um^c{K, Mq) between the bounds. The best we can do is to 
run numerical simulations to obtain the average of [/tdnc(-^> Md). Fig. 1 demonstrates the bounds and numerical 
average of C/tdnc(^ Md) with K = 20 wanted original packets. For each value of Mo, random permutations of 
Mo zeros and K(K — l)/2 — Mo ones are assigned to the entries of C. Optimum IDNC algorithm which will 
be proposed in the next subsection is then performed to determine the C/idnc f° r this C. 

While the upper-bound decreases gradually with Mo, the lower-bound falls down very quickly at first. With as 
few as 30 zero entries, it is possible to reduce ?7idnc from 20 to only 5. After that, the lower-bound decreases 
much more slowly. It requires another 16 zero entries to reduce the lower-bound from 5 to 4 and from 4 to 
3. This number is 32 for reduction of [7idnc(^i Md)~ from 3 to 2, and is as large as 96 for its reduction 
from 2 to 1. The largest gap between the two bounds is 13, taking place when Mo = [30,37]. The average of 
^idnc(-^ = 20, Mo), without surprise, lies between the two bounds. It decreases smoothly with Mo and is closer 
to the lower-bound than to the upper-bound. 



D. Optimum IDNC Encoding Algorithm 

An optimum IDNC encoding algorithm should provide a solution which satisfies the demand of all the receivers 
with f/iDNC transmissions in the best case scenario with no packet erasures. In our model, the algorithm should 
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20 40 60 80 100 120 140 160 180 200 

Number of zeros (M ) 

Fig. 1: Theoretical bounds and numerical average of iVidnc against Mo with K = 20. 

find minimal collections of the conflict matrix C, which is similar to the minimum clique cover problem in 
graph theory and is generally NP-hard [29], [33]. In [29], the authors proposed an optimum algorithm based 
on exhaustive search: all the 2 K possible encoding sets (whether instantly decodable or not) were generated. 
Then all the size-1, 2, • • • collections of these sets were examined to see if any of them satisfy both the IDNC 
constraints and the diversity constraint. The process ended after all the size-C/iDNC collections of these sets were 
examined. However, this algorithm is highly computationally expensive even with a medium-size Vk [29] . In this 
subsection, we propose to partition this task into two stages to reduce the computational load. Further reduction 
can be achieved if the second stage is replaced by a heuristic algorithm. The two-stage process is as follows: 

1) Finding all the maximal cliques: An efficient algorithm is Bron-Kerbosch algorithm [34] which finds all 
the maximal cliques of an undirected graph in a recursive way. We use this algorithm to obtain all the 
maximal encoding sets. The group of all maximal encoding sets is denoted by A. 

2) Finding minimal collections from A: There is no polynomial time algorithm for this task. We propose an 
iterative algorithm which is much faster than exhaustive search. The first intuition here is that if a packet 
belongs to only one of the maximal encoding sets in A, this set has to be transmitted. A more general 
intuition is that if a packet belongs to d maximal encoding sets, one of these d sets has to be transmitted. 

Remark 2. After obtaining all the maximal encoding sets in the first stage, a simple heuristic algorithm can be 
employed to find a collection instead of Algorithm 1. For example, in a greedy algorithm, we first choose the 
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Algorithm 1 Finding minimal collections from A 
1: Generate a group B of collections which contains only an empty collection S, and generate an iteration 

counter u = 1. 

2: while None of S in B satisfies the diversity constraint do 

3: while Not all collections in the group B have a size of u do 

4: Pick a collection in B with size u — 1, say 5 = {Mi, • • • , .M u _i}. Denote the original packets within 
S by V u -i = UrX esi-Mi} and all the remaining original packets by V u -\ = Vk\V u -i- Also denote 
the group of maximal encoding sets of C other than Mi, ■ ■ ■ ,M u -i by S = A \ S. 

5: Pick a packet, say P, in V u -\ which has the smallest diversity d within S and find out all the maximal 
encoding sets comprising it, say M[, ■ ■ ■ ,M f d ; 

6: Branch S into d new collections, S[,--- ,S' d . Then add M' l} --- ,M' d to them respectively. These 
collections have a size of u and replace S in B. 

7: end while 

8: U = U + 1 

9: end while 

largest maximal encoding set. Then for the remaining packets not belonging to this set, we look for a set which 
comprises most of them. This iterative algorithm is much faster than Algorithm 1, but can be suboptimal in terms 
of iTiDNC- There are also other heuristic algorithms (based on graph model or not) in the literature, interested 
readers are referred to [29]. 

Remark 3. For a given SFM A we find all the minimal collections of its corresponding conflict matrix C 
through Algorithm 1. If there are more than one such collection, we can select the one having larger diversities 
for the packets wanted by more receivers to gain the highest system robustness to packet erasures. Our proposed 
selection criteria for a minimal collection S, which is a scalar value and denoted by a, is given by 

K 

a{S) = Y,d k T k (9) 
k=l 

where d^ is the diversity of within S and is the number of targeted receivers of Then the transmission 
order of the maximal encoding sets in this collection could be adjusted to benefit more receivers first. 
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In this paper, we will always use the above process to find the minimal collection of A which has the largest 
average diversity and has its maximal encoding sets reordered based on maximum decoding benefit to receivers. 
This minimal encoding set is the minimal collection of this A for transmission and used for throughput and delay 
analysis. Below is an example of the proposed two-stage IDNC encoding process: 

Example 3. 
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Given the SFM, A and its corresponding conflict matrix C, it can be easily verified that the maximal encoding 
sets are {P\,P%}, {P2, P3, P5}, {^3,^4}, {P^Pq} and {P§,Pq}. Then the algorithm will find the minimum 
collection(s) as follows. : 

• In the first iteration, P\ with d\ = 1 is picked, thus S = {{P\, P3}}. This collection does not satisfy the 
diversity constraint; 

• In the second iteration, the remaining packets are P2,Pa,P5 and Pq. Then P2 with c?2 = 1 is picked. Thus 
S = {{Pi, P3} , {P2, P3, P5}}. This collection still does not satisfy the diversity constraint; 

• In the third iteration, the remaining packets are P4 and Pq, with d± = d§ = 2. Then P4 is picked. Thus 
S x = {{Pi,P 3 }, {P2, P3, P5}, {P3, Pa}} and S 2 = {{Pi, P 3 }, {P2, P 3 , P5}, {Pa, A}}- Because Si does not 
satisfy the diversity constraint but S2 does, S2 is the minimal collection and Uibnc = 3. 

• Because there is only one minimal collection, no additional selection process is needed. 
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• We then can reorder S2 into S = {{Pi, P3}, {P4, P&}, {P2, P3, P5}} to benefit 5, 5, and 3 receivers in the 
first, second and third transmissions, respectively. 

We remark that other selection and reordering approaches can be easily adopted if higher priorities are issued 
to specific packets or receivers, which depends on the application. 

V. The Disadvantage of IDNC over RLNC: Throughput Analysis 

Since rateless transmission is assumed, system throughput is inversely proportional to the total number of 
transmissions required to successfully broadcast the current block of Kt original data packets to all the receivers. 
This number consists of three parts: 

1) In the systematic transmission phase, Kt uncoded original data packets are transmitted. This number is 
fixed regardless of whether we use RLNC or IDNC later. Therefore, we will not consider it in this section. 

2) In the first round of coded transmission phase, a minimum of C/idnc an d C/rlnc transmissions are required 
by IDNC and RLNC, respectively. In general, we have C/idnc ^ C/rlnc- m this section, we will investigate 
how the number of receivers affects the gap between C/idnc an d C/rlnc- 

3) After the first round of coded transmission, extra transmissions may be still needed due to erasures. We 
denote the number of extra transmissions in the second round by Vldnc ar >d Vrlnc f° r IDNC and RLNC, 
respectively. In this section, we will derive their probability distribution and verify the derivations through 
simulations. 

By combining two coded transmission rounds (that is by studying C/idnc ar >d ^idnc together and C/rlnc an d 
Vrlnc together), we can shed light on the relative throughput performance of IDNC and RLNC. 

A. Comparing C/idnc and C/rlnc 

Our objective here is to compare the mean of C/idnc an d the mean of C/rlnc under the same system parameters: 
total packet number Kt, link erasure probability P e , and number of receivers N. By changing N, we are able 
to observe how the number of receivers affects the gap between C/idnc an d C/rlnc- 

For RLNC, recall that the minimum of coded transmissions, C/rlnc . is equal to the maximum number of 
lost packets across receivers in the systematic transmission phase, W max . The relationship between the mean of 
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?7rlnc and N can be written as: 

Pr(U RLNC **W) = Pr(WW ^W)=(j2 ) P K l ~ Pef^ (10a) 

K T 

£[^rlnc] = E[W max ] = W{Pi(W max ^W)- Pr(VF max ^ W - 1)) (10b) 

where Pr(W max ^ W) is the cumulative distribution function (CDF) of W max with W G [0,Kr]. This equation 
indicates that C/rlnc increases quickly with N when iV is relatively smaller than Kt and increases gradually 
with N when TV is relatively greater than Kt, as demonstrated in Fig. 2, where Kt = 15 and P e = 0.2. 

For IDNC, we know that with increasing N, more original packets may be wanted after the systematic 
transmission phase and thus the size of the conflict matrix C will increase. Furthermore, every new receiver 
may incur new conflicts between packets and thus the number of zeros in C will decrease with N. Both of these 
two factors will increase L^dnc- However, there is no analytical expression for the relationship between the mean 
of fTrDNC an( l N- A s an alternative, we resort to numerical simulations. We randomly generate M samples of the 
SFM A according to the system parameters Kt, P e and N. Then we average the C/idnc across the simulated 
SFMs to obtain the average JTidnc- 

The numerical results with K T = 15, P e = 0.2, M = 10 5 and N G [1,45] are plotted in Fig. 2. Similar 
simulation is applied to RLNC and the average C/rlnc is also plotted in this figure. As expected, the simulated 
average matches very well the theoretical mean, derived in (10). Our first observation is that C/idnc increases 
quickly with N when N is smaller than Kt and then increases almost linearly with N when N is greater than 
Kt- The second observation is that C/idnc is almost the same as £/rlnc when N is smaller than Kt, but is 
greater than C/rlnc when N is greater than Kt- This gap grows gradually with N. 

Therefore, we conclude that IDNC is a good alternative to RLNC in terms of the absolute minimum number 
of coded transmissions when the number of receivers N is smaller than Kt, but is inefficient when N becomes 
larger. 

B. The Distribution of Vidnc and Vrlnc 
1) The Distribution o/ V[dnc-' 

For a given SFM A, we first consider the probability that a receiver R n still needs V n original packets after the 
first round of coded transmissions. This is denoted by Pr(V^), where V n ranges from (no more transmission 
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^Average of U RLNC 

Theoretical average of U RLNC 
from Eg. (11) 
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Fig. 2: Average C/idnc and E/rlnc versus N (Kt = 15, P e = 0.2). 
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needed) to W n (receiver R n did not receive any innovative packets during C/tdnc transmissions due to poor 
channel conditions). 

For obtaining Yr{V n ), we assume a specific minimal collection S or A with a size of C/tdnc was chosen for 
the first round of coded transmission, where dk is the diversity of P^ within this minimal collection. We denote 
the n-th row of the SFM A after the systematic phase by a n , which is the state feedback vector of R n . The 
number of ones in a n is represented by ||a n ||i> where || • ||i means 1-norm and we have ||a n ||i = W n . We further 
denote the n-th row of the updated A after the first round of coded transmission by a' n . It may take any form 
under the constraint that a' n ^ a n , i.e., every entry in a' n is smaller than or equal to the corresponding entry in 
a n . Then, the general expression of Pr(V^) can be written as: 



Pr(V n = V)= Yl II 

V< :\\a' \W=V \Vfe:P fc eW„ 



a' n (k) 



l-(Pe. 



l-a' n (k) 



(11) 



where a' n {k) is the /c-th entry of a' n . 

For example, suppose that K = 4 after the systematic phase, and R n still wants Pi, P3 and P4, i.e., a n = 
[10 11]. Suppose that R n still wants V n = 2 packets after the first round of coded transmission. Then there 
are three possibilities for a' n as follows: a' n = [001 1], a' n = [1001] and a' n = [1010]. The probability of 
a' n = [0 1 1] is the product of the probabilities of three independent events: Pi has been received by P n , but 

P3 and P4 have not. Their corresponding probabilities are 1 — P e dl , P^ 3 and P^ 4 , respectively. 5 By summing up 

5 For example, for Pi to be received by R n it suffices that the receiver did not experience erasures during at least one out of dk time 
slots where Pi was broadcast. Also, since a' n (k) = or a' n (k) = 1, the appropriate event probability in (11) is automatically chosen. 
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the probabilities of the three possible a' n , we obtain Pr(V„ = 2). 

Remark 4. From (11), the significance and impact of packet diversity on IDNC overall throughput should 
become apparent. The larger the packet diversity, the higher the probability that a receiver successfully receives 
its requested packet during the first C/idnc coded transmissions and hence, the lower probability that more 
transmissions are required in the second round. This is the main reason why the sender should send maximal 
encoding sets (rather than disjoint sets) in a minimal collection with the largest average diversity across all 
receivers as suggested by (9). 

By employing (11) we are able to derive the distribution of Pr(V[DNc) : 

• Having computed Pr(y n = 0) for a given R n , the probability that no receiver needs more transmissions 
beyond ?7tdnc> is simply 

N 

Pr(li DNC = 0) = J] Pr(K = 0) (12) 

7Z=1 

• Having computed Pr(V„ = 0) and Pr(V n = 1), the cumulative distribution function (CDF) of V n for R n is: 

Pr(K < 1) = Pr(K = 0) + Pr(K = 1) (13) 
Now given Pr(V n ^ 1) for all the receivers R n G 1Z, the CDF of Vednc f° r the system is: 

N 

Pr(lW < 1) = II Fl( - Vn < X ) ( 14 > 
n=l 

• Similarly, we can calculate the CDF value Pr(V n ^ 2) for a given R n . If every receiver needs at most two 
packets, the probability that the system needs at most two extra transmissions is upper bounded by 

N 

Pr(lW ^2)^n Pr(K s$ 2) (15) 

n=l 

The inequality comes from the fact that if more than 2 receivers need two packets, we might need 3 or more 
transmissions to accommodate this using IDNC. 

• Iteratively, we can compute Pr(T4 ^ 3) and so on up to Pr(V^ ^ W n ). Notice that ~Pr(V n ^ W n + 1) = 
• • • = Pr(y n ^ £/idnc) = 1- Then an upper-bound on the CDF, Pr(ViDNC ^ V), for any V 6 [3, f/iDNc] is 
obtained. 

Consequently, we estimate the PDF of VJdnc as: 

Pr(li DNC < V), V = 



Pr(Fi D NC = V) = { 



(16) 

Pr(li DNC <V)- Pr(ViDNC < V - 1), V £ [1, ?7idnc] 
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We note that Pr( Vidnc = V) for V ^ 3 is an approximation rather than a provable bound. In order to verify its 
accuracy, we run some simulations as follows: 

1) Randomly generate an SFM A and find out the minimal collection S of A which has the largest average 
diversity. 

2) Transmit the maximal encoding sets in S through erasure links in the first round of coded transmission 
phase. Collect feedback and update A, denoted by A'. 

3) Calculate the absolute minimum number of coded transmissions of A', the result is an instance of Vidnc 
of A. 

4) Repeat steps 2-3 M times and obtain M instances of Vidnc- 

5) Work out the numerical PDF of Vidnc using these M instances. 

The approximate PDF of Vidnc of this A is calculated using (16) and is compared with the numerical PDF 
of Vidnc- The mean-squared-error (MSE) of the approximate PDF is then obtained. This simulation is repeated 
for 10 3 randomly generated A. The average MSE with system parameters Kt = 15, N = 10, P e = 0.2 and 
M = 10 5 is 2.44 x 10~ 4 . Similar simulations on RLNC show an average MSE of 2.13 x 10~ 4 . Both approximation 
errors are acceptable because the theoretical PDF of Vrlnc (which will be derived next) is exact. Therefore, the 
approximate PDF of Vidnc that we derived in (16) is reliable. Fig. 3 demonstrates the simulation results for a 
random sample of A. 

C. The Distribution of Vrlnc 

Similar to the case of IDNC, we also consider the probability that a receiver R n still needs V n transmissions 
after the first round comprising ?7rlnc = ^Vmax coded transmissions. Denote this probability by Pr(V^), where 
V n ranges from to W n . Generally, it can be calculated as: 

YZ=W n ( W r) (Pe) W ™- W (1 - PeT , V = 

Pr(V n = V) = I (17) 

k (w:-v)( P e) W -- Wn+V (l ~ Pe) Wn ~ V , V € [1, W n ] 

To demonstrate this equation, we consider the cases of V n = 0, 1, 2. Then we employ them to derive the 
distribution of Vrlnc- 

• Pr(V n = 0) is the probability that R n has received at least W n encoded packets during W max RLNC 
transmissions. Hence the first line of Eq. (17). Then the probability that no receiver needs more transmissions 
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No. of extra transmissions, V 

Fig. 3: Theoretical and numerical PDF of Vidnc and Vrlnc for a randomly generated SFM A (Kt = 15, P e = 0.2). 



after the first round of coded transmission is simply 

N 

Pr(V R LNC = 0) = n Pr ( y « = °) < 18 ) 
n=l 

• Pr(F n = 1) is the probability that R n has received W n — 1 encoded packets during Winax RLNC transmis- 
sions. Similar to the operations in the IDNC case, we can write: 

Pr(Fn = 1} = Qj^) (p e )W—W.-i)(i _ Pe) w,-i (19) 
Pr(y n < 1) = Pr(V n = 0) + Pr(F n = 1) (20) 

N 

Pr(^RLNC < 1) = II Fl ( Un ^ X ) (21) 

n=l 

where Pr(VRLNC ^ 1) 1S the cumulative probability that at most one more transmission beyond J/rlnC is 
needed by the system. 

• Iteratively, we can compute Pv(V n Sj 2) and so on up to Pr(V^, ^ W n ). Notice that Pr(V n Sj W n + 1) = 
• • • = Pr(V n ^ £/rlnc) = 1- Then the CDF Pr(VRLNC ^ V) f° r anv V G [0, C/rlnc] can be calculated and 
there is no approximation involved for RLNC. 

Consequently, the theoretical PDF of Vrlnc can be calculated through subtracting adjacent CDF values of Vrlnc 
in the same way as in (16). This PDF is exact. An example of the theoretical and numerical PDF of Vrlnc are 
plotted in Fig. 3 and they well match each other. 



Pr(i/ IDNC = U) 



(22) 



Pr(^R LNC = U) = { 



(23) 
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D. Numerical Comparison 

In this subsection, we combine the number of transmissions in the first and second rounds of coded transmission, 
which is denoted by i/iDNC and #rlnc f° r IDNC and RLNC schemes, respectively. The PDF of H reveals the 
relative throughput performance of IDNC and RLNC. For a given SFM A, the PDF of -f/roNC is expressed as: 

^0, U G [0, C/idnC - 1] 

Pr(FlDNC = U- f/iDNC), U G [?7lDNC, 2C/lDNc] 

Similarly, the PDF of #rlnc i s: 

0, U G [0, f/ RL NC - 1] 

Pr(V R LNC = C/ - t/RLNc), ?7 G [t/RLNC, 2[/ RL Nc] 

We then numerically obtain the average PDF of both .?/idnc and #rlno The results for a system with 
K T = 15, P e = 0.2 and N = 5,15,20,30 are plotted in Fig. 4. When N is as small as 5, the throughput 
performance of IDNC is almost the same as that of RLNC. When N is equal to Kt, there is only marginal 
throughput performance gain of RLNC over IDNC. When iV further increases, the gap between them grows. The 
#idnc having the largest PDF value under N = 30 is 9, while the -Hrlnc having the largest PDF value under 
N = 30 is 7, indicating that IDNC in general may require 2 more coded transmissions than RLNC after the 
systematic phase. Given the better decoding delay performance of IDNC to be examined in the next section, we 
conclude that IDNC is a good alternative to RLNC when N is relatively smaller than Kt- Recall that the gap 
between C/idnc an d £^rlnc under N = 30 is 1.5, as shown in Fig. 2. We further conclude that the gap between 
^idnc and C^rlnc is the dominant factor on the gap between #idnc an d -ffRLNC* while the gap between Vidnc 
and Vrlnc plays a secondary role. 

VI. The Advantage of IDNC over RLNC: Packet Decoding Delay Analysis 

In the previous section, by showing that RLNC in general requires less block completion delay than IDNC, we 
confirmed that the throughput performance of IDNC never exceeds RLNC. In this section, we consider another 
type of delay, called packet decoding delay, for which IDNC may be better than RLNC. It is defined as follows: 

Definition 14. Packet decoding delay is the number of transmissions after which a wanted original data packet 
can be decoded by a receiver. 
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No. of transmissions after the systematic phase, H No. of transmissions after the systematic phase, H 
(a) N=5 (b) N=15 




No. of transmissions after the systematic phase, H No. of transmissions after the systematic phase, H 
(c) N=20 (d) N=30 

Fig. 4: Throughput comparison between IDNC and RLNC with Kt = 15, P e = 0.2 and different number of receivers N. 

This type of delay metric is important for determining IDNC and RLNC performance for intermediate packet 
delivery and especially in applications where packet delivery can occur regardless of packet order, such as 
multiple-description source coding [9], [11], [17]. 

Because IDNC and RLNC share the systematic transmission phase, we only consider original data packets 
which are still wanted after the systematic transmission phase, i.e. packets in Vk- Furthermore, which packets 
will be still wanted after the first round of coded transmission phase depend on the channel erasure patterns 
during this round, which makes the decoding delay analysis intractable. Therefore, we only consider the first 
round of coded transmission phase and thus study the packet decoding delays in the closed interval [1, t/rDNc] 
for IDNC and [1, C/rlnc] for RLNC. 

We then define a type of conceptual packet called algorithmic packet to enable the calculation of packet 
decoding delay. Its intuition comes from the fact that if an original packet has T targeted receivers, its decoding 
delay at these T receivers should be counted separately. 

Definition 15. Algorithmic packets corresponding to an original data packet, P^, are the duplicates of that packet 
at its targeted receivers. The receiving probabilities of these packet duplicates are independent of each other. 
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Given an SFM A, there are K original packets with Ti, • • • , Tk targeted receivers. Thus there are in total 
?totai = 2~2k=i T k algorithmic packets. They are denoted by Q = {Qi,Q 2 , ■■■ , Qr totlil }, where Q±, - ,Q Tl are 
duplicates of Pi at the receivers in Ti, Qti+i, ■ ■ ■ , Qt x +t 2 are duplicates of P2 at the receivers in T2, and so on. 
Their decoding delays are denoted by Zi, ■ • • , Zr total , respectively, whose values can range from 1 to C/idnc f° r 
IDNC and from 1 to £7rxnc f° r RLNC, respectively. In this section we aim to propose useful metrics to measure 
the decoding delay of these algorithmic packets and investigate how IDNC could outperform RLNC in terms of 
these metrics. As a starting point, we calculate the average packet decoding delay: 

L = ^Jh. (24) 

1 total 

Denote the number of decoded algorithm packets in the u-th transmission by D u , where u ^ J/idnc or 
u ^ ZTrlnc- Then, the average packet decoding delay can also be calculated as: 

L = E y UDu (25) 

1 total 

where U = ?7idnc or C/rlnc- In the best case scenario where channel links are erasure-free, average packet 
decoding delay of both IDNC and RLNC can be easily calculated and compared given the SFM A. Below is an 
example: 

Example 4. 

Consider the SFM A in Example 3 and its minimal collection S = {{Pi, P3}, {-P4, Pq}, {P2, P3, P$}}- There 
are T tota i = 13 algorithmic packets. We then demonstrate the decoding progress of IDNC and RLNC in the first 
round of coded transmission phase, respectively, assuming erasure-free packet reception at all receivers: 

• IDNC: The first transmission will provide T\ + T3 = 5 algorithmic packets. The second transmission will 
provide T4 + Tq = 5 algorithmic packets. The final transmission will provide T2 + T5 = 3 algorithmic packets 
(notice that P3 has already been decoded in the first transmission). The average packet decoding delay of 
this A if IDNC is employed will be: 

5x1+5x2+3x3 

MDNC = ~ 1-00 

which means each algorithmic packet needs to wait about 1.85 transmissions for decoding. 

• RLNC: In the first transmission, no receiver can decode anything because there is no receiver who wants 
only one packet. In the second transmission, both R2 and R$ can decode 2 algorithmic packets, in total 4 
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algorithmic packets. In the final transmission, R±, R3 and R4 can decode 3 algorithmic packets respectively, 
in total 9 algorithmic packets. The average packet decoding delay of this A if RLNC is employed will be: 

Ox 1 + 4x2 + 9x3 o _ 

-kRLNC = ^ ~ 2.7 

which means each algorithmic packet needs to wait about 2. 7 transmissions for decoding. It is 46% higher 
than Lidno 

In the presence of link erasures, computing the exact value of D u is intractable. Instead, we determine the 
statistical mean of D u , denoted by E[D U ]. Then the mean of average packet decoding delay is defined as: 



YZ=iuE[Du] 

TH =1 e[d u } 



e[l\ = t^ 1 zt. 



where U = C/tdnc or JTrlnc- We derive E[D U ] for both IDNC and RLNC in the next subsection. 

A. E[D U ] of IDNC and RLNC 
1) E[D U ] of IDNC: 

Consider an SFM A and a given minimal collection S = {M\, M2, ■ ■ ■ ,M.u mNC } (for example, the one that 
has the largest average diversity according to (9) and has its maximal encoding sets reordered to benefit most 
receivers first). In the u-th transmission, packets in M u are encoded using F2 and transmitted. For each packet 
Pk> € M-u, its number of targeted receivers is denoted by TV and its diversity within the group of already 
transmitted maximal encoding sets {Mi, ■ ■ ■ ,M U } is denoted by d^(u). Consequently, each packet provides 
TV algorithmic packets, sharing independent and identical probability of being decoded by its targeted receiver: 

Pr(P k , decoded at u) = - P e ) (27) 

which means that in the first dk>(u) — 1 transmissions about P^, a targeted receiver always experienced erasures. 
But in the u-th transmission, this receiver does not experience erasure. Consequently, the mean number of decoded 
algorithmic packets, E[D U ], is: 

E[D U ] = T k' Pr ( p fe' decoded at u) (28) 

k>:P k ,eM u 

Remark 5. From (27) and (28), the significance of packet diversity and early transmission of packets requested 
by more receivers on the IDNC decoding delay performance should become apparent. The larger the packet 
diversity (u), the higher the probability that a receiver has already successfully decoded its requested packet 
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by time slot u. This generally enhances IDNC decoding delay performance and is the main reason why the sender 
should send maximal encoding sets (rather than disjoint sets) in a minimal collection with the largest average 
diversity across all receivers as suggested by (9) and should also send maximal encoding sets that target most 
receivers first. 

2) E[D U ] of RLNC: 

For a receiver R n £ 1Z who wants W n ^ n original data packets, the probability that it can decode W n packets 
in the n-th transmission is equal to the probability that it did not experience erasures in W n — 1 out of the first 
u — 1 transmissions and is also not in erasure in the n-th transmission: 

Pr(W n ,u) = (^~_\)pr W "(l " Pe) W " (29) 

If R n can decode in the n-th transmission, it will generate W n decoded algorithmic packets. Thus the mean 
number of decoded packets that R n generates in the n-th transmission is W n Pr(W n , n). Consequently, the mean 
of D u across all receivers is: 

E[D U }= £ W n Vr{W n ,u) (30) 

To this end, we have derived the E[D U ] of a given SFM A under both IDNC and RLNC schemes. By 
substituting them into (26) we obtain the average packet decoding delay ^[Lidnc] an d -E^rlnc] of this A. 
This metric could be a useful criterion for adaptively choosing between IDNC or RLNC after the systematic 
network coding phase. 

Remark 6. Although we have only computed the mean decoding delay across receivers, (27) and (29) completely 
characterize probabilistic decoding delay performance of IDNC and RLNC. For example, it is possible to compute 
higher order statistics of the decoding delay. 

B. The impact of number of receivers on the average packet decoding delay 

In this subsection, we investigate how the number of receivers, N, impacts the decoding delay performance 
of IDNC and RLNC schemes. Because the derivation of both £^[Lidnc] an d -E[£idnc] is based on a specific A, 
the investigation is partly through simulations. For a given set of system parameters Kt, P e and N, M samples 
of A are randomly generated and their ^[-Lidnc] and ^[Lrlnc] are then analytically calculated. The average 
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of the M values of /^[Ltdnc] is the average E'fLrDNc] under the current set of system parameters and so is the 
average E[L RhNC ]- 

Fig. 5 shows the numerical results of the average -E[Lidnc] an d -E* [-^rlnc] with parameters Kt = 15, P e = 0.2, 
M = 10 5 and N ranging from 1 to 45 with a step of 2. Our first observation is that the decoding delay profile of 
IDNC is similar to the throughput delay profile of IDNC. That is, /^[Ltdnc] increases almost linearly with N, 
from 2.7 when N = 5 to 4.7 when N = 45. Meanwhile, the average packet decoding delay of RLNC increases 
slowly with N and the increment is marginal when N is greater than Kt- The second observation is that, when 
N is smaller than Kt, the average decoding delay of RLNC is between 30% to 50% greater than IDNC. It 
indicates a significant decoding delay performance gain of IDNC over RLNC. This gain decreases gradually with 
N and both IDNC and RLNC have a average decoding delay of 4.2 when N = 33, which is just above twice 
of Kt- After that, the decoding delay performance of RLNC is better than IDNC. Interestingly, we can attribute 
the better performance of RLNC over IDNC in the last regime where N is significantly larger than Kt to its 
better throughput (block completion delay) performance as observed in Fig. 2. In other words, when N » Kt, 
^idnc > ^rlnc> which postpones the decoding opportunity of some packets to much further time slots in IDNC. 
This clearly shows the interplay between throughput and decoding delay in network coded schemes, which is 
often very hard to characterize. 

Recall that IDNC is a good alternative to RLNC in terms of throughput when N is relatively smaller than Kt- 
Considering both the throughput and decoding delay performance, we make the following three conclusions from 
a statistical perspective: 1) When N is relatively smaller than K T , IDNC is more preferable than RLNC; 2) When 
N is roughly in the range [Kt,2Kt], IDNC and RLNC lead in decoding delay and throughput, respectively, 
which scheme to choose depends on the specific application; and 3) When ./V is larger than 2Kt, RLNC is a 
better choice. 

VII. Conclusion 

In this paper, we provided a systematic framework for throughput and decoding delay analysis and comparison 
of random linear network coding (RLNC) and instantly decodable network coding (IDNC) schemes in erasure- 
prone packet broadcast scenarios. Our approach was based on analyzing the two schemes under the same packet 
reception status after the systematic transmission phase. For IDNC, which was the harder problem to solve, this 
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Fig. 5: Average packet decoding delay of IDNC and RLNC versus N (Kt = 15, P e = 0.2). 



was enabled through a comprehensive graph theoretical study of the IDNC model and its encoding process. In 
particular, we presented easily-calculable upper bound and lower bounds on the minimum number of coded 
transmissions, {/idnc based on the packet conflict matrix. We also presented an optimum IDNC encoding 
algorithm to minimize [/idnc- We introduced the notion of packet diversity, which we showed to be a key 
factor in determining the probability of needing residual coded transmissions beyond C/idnc an d i ts decoding 
delay performance. Similar throughput and decoding delay metrics were derived (with more ease) for the RLNC 
counterpart. Finally, we provided insights into the role of feedback frequency on the complexity and overall 
throughput performance of IDNC. 

This comparative framework is useful for selecting between IDNC and RLNC given a specific instance of 
the packet reception status (SFM A) depending on the specifically obtained throughput and delay performance 
metrics for each scheme. Furthermore, it enables statistical study of the impact of number of receivers on the 
average throughput and decoding delay. For example, we observed that both the throughput and decoding delay 
performance of IDNC degrades gradually as the number of receivers increases, while such degradation is much 
weaker in RLNC. Furthermore, IDNC is more preferable than RLNC when the number of receivers is smaller 
than the size of packet block, and the case reverses when the number of receivers is much greater than the size 
of packet block. In the middle regime, there is no clear winner as the throughput of RLNC is higher while the 
decoding delay of IDNC is lower. 
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Appendix A 
Proof of Theorem 2 

We prove that if there is a collection S = {Mi, • • • , Mu} satisfying the diversity constraint, there is also a 
[/-coloring solution of Q and vice versa. An eligible [/-coloring solution is denoted by {/Ci, • • • , /Q/} and has 
the following three properties: 1) the vertices in the same set /Q share the same color; 2) any two of vertices in 
the same ZCj are not connected; 3) the intersection between any two sets /Q and K.j is empty. 

If S = {Mi,-- - ,Mu} satisfies the diversity constraint, we can always construct a new group of vertex 
sets as follows, which also satisfy the diversity constraint: Set ¥i\ = Mi, then sequentially /C2 = M2 \ ZCi, 
K.3 = M3 \ [J{/Ci, /C2}, • • • , K-u = Mu \ U£i 1 {^}- Following this construction process, it is clear that the 
intersection between any two sets /Q and )Cj is empty, thus property 3) is satisfied. Because /Q C Mi, every 
vertex in the same /Q is connected to each other under Q. Therefore, they are all disconnected under Q and 
property 2) is satisfied. Hence, if we assign U colors to Ki,- ■ ■ ,)Cu, they form a [/-coloring solution of Q. 

Conversely, if {/Ci, ■ ■ ■ ,JCu} is a [/-coloring solution of Q, it is also a minimum clique cover solution of Q. 
For each ZQ, we can find a maximal clique Mi where ZQ C Mi, then S = {M\, • • • , Mu} satisfies the diversity 
constraint. 

Hence, the minimum collection size of the conflict matrix C equals the chromatic number of Q. 

Appendix B 
Proof of Theorem 3 

Suppose M is a maximal clique of Q and the chromatic number of Q is U. Assuming the chromatic number 
of Q' = Q \ M is U' < U — 1, there exists at least one [/'-coloring solution of Q', denoted by {/Ci, • • • ,JCu>}- 
If this is the case, {/Ci, • • • ,Ku>, M} is a eligible ([/' + l)-coloring of Q and U' + 1 < U, contradicting the 
assumption that the chromatic number of Q is U. Thus the chromatic number of Q' is at least U — 1. 

The chromatic number of Q' will still be U if the removed M does not belong to any minimal collection of 
C. This is equivalent to saying that there is no K, C M in any [/—coloring solution of Q, because otherwise we 
could obtain a minimal collection S of C comprising M using the [/-coloring solution, which includes this JC 
in the way we described in Appendix A. Hence, there are at least two colors involved in the vertices of M and 
there are vertices outside M with the same colors. Hence, removing vertices in M from Q cannot reduce the 
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chromatic number of the resulted Q' to less than U. 



Appendix C 
An Example for Uf$$? < tf^c onlinc 
Consider an initial SFM A with three wanted packets and three receivers: 
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and a channel erasure pattern with 5 future transmissions: 
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where u is the transmission time slot index, X means erasure and O means successful packet reception. 
We then describe the transmission progress in both semi-online feedback and fully-online feedback cases: 
• Semi-online feedback 

1) P 1 ,P 2 and P 3 all conflict with each other. Thus S = {{Pi}, {P 2 }, {P3}} and U^ DNC = 3 in the first 
coded transmission round. P\ is transmitted first, followed by P2 and P3. 

2) After 3 transmissions, the sender collects feedback and the updated A' is: 



Pi 


P2 


^3 




1 








Ri 


1 





1 


R2 











Rs 



We note that W2 = 2 and it can be easily verified that fT^NC = 2 in the second coded transmission 
round. Then Pi is transmitted again in the fourth transmission, followed by P3. After that, all the three 
receivers' demands are satisfied. 
Thus the semi-online feedback scheme requires U^^q online = Ul DNC + Uj DNC = 5 coded transmissions. 
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. Fully-online feedback 

1) After transmitting P\ in the first transmission, the sender collects feedback and the updated A' is equal 
to A; 

2) Thus in the second transmission, Pi will be transmitted again. The sender then collects feedback and the 
updated A' is: 

Pi P2 Ps 



1 

A' = 



Ri 
R2 



1 
1 1 

3) In the third transmission P2 is transmitted because S' = {{-P2}, {-P3}} for A'. P2 will be received by 
both Ri and R% in this transmission and thus the only wanted packet after that is P3. 

4) In the fourth transmission, P3 is transmitted and will be successfully received by both R2 and P3. 

Thus fully-online feedback scheme requires U°^q = 4 c °ded transmissions, which is smaller than U^^q ° nlm e 
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